My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·1d·
Discuss: Hacker News
🎨Vulkan
Flag this post
Disciplined Biconvex Programming
arxiv.org·2h
🔀Parallel Algorithms
Flag this post
Synopsys and NVIDIA Forge AI Powered Future for Chip Design and Multiphysics Simulation
semiwiki.com·17h
🎮Game Engines
Flag this post
onedraw — a GPU-driven 2D renderer
dev.to·1d·
Discuss: DEV
Shader Programming
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.me·4d·
Discuss: Hacker News
🔢SIMD
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.org·2h
🦀Rust
Flag this post
Looking for a partner to study graphics programming with
reddit.com·19h·
Discuss: r/gamedev
Shader Programming
Flag this post
Show HN: a Rust ray tracer that runs on any GPU – even in the browser
github.com·17h·
Discuss: Hacker News
🔺Mesh Shaders
Flag this post
Co-Simulation Framework for Parallel DNN Execution on Chiplet-Based Systems (UW–Madison, Washington State)
semiengineering.com·10h
🔢SIMD
Flag this post
Design of quasi phase matching crystal based on differential gray wolf algorithm
arxiv.org·2h
Shader Programming
Flag this post
CHIP8 – writing emulator, assembler, example game and VHDL hardware impl
blog.dominikrudnik.pl·10h·
Discuss: Hacker News
🔩Assembly
Flag this post
I just trained a physics-based earthquake forecasting model on a $1000 GPU
news.ycombinator.com·7h·
Discuss: Hacker News
🎮Game Engines
Flag this post
Troubleshooting multi-GPU with 2 RTX PRO 6000 Workstation Edition
reddit.com·21h·
Discuss: r/LocalLLaMA
🎮Game Engines
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·1d·
Discuss: Hacker News
Programming
Flag this post
H-FA: A Hybrid Floating-Point and Logarithmic Approach to Hardware Accelerated FlashAttention
arxiv.org·2h
🧠Memory Management
Flag this post
Deep Integration and the Convergence of Model Architecture and Hardware in AI
dev.to·1d·
Discuss: DEV
🔧FPGA
Flag this post
Programming for Computations: Matlab/Octave
link.springer.com·1d·
Discuss: Hacker News
🔢SIMD
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·1d
📊Performance Tools
Flag this post
Geonum – geometric number library for unlimited dimensions with O(1) complexity
github.com·17h·
Discuss: Hacker News
🔢SIMD
Flag this post
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
arxiv.org·1d
Shader Programming
Flag this post